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GTCATGAAAT TGGAATCTGA CAAGACGTTC CCAATCATGT TGGAAGGGAA 
GATAAAGGGC TACGCTTGTG TGGTCGGAGG GAAGTTATTC AGGCCGaTGC 
atgtgg4agg CAAGATCGAC AACGACGTTC TGGCCGCGCT TAAGACGAAG 
AAAGCAtCCA AATACGATCT TGAGTATGCA GATGTGCCAC AGAACATGCG 
GGCCGAjACA TTCAAATACA CCCATGAGAA ACCCCAAGGC TATTACAGCT 
GGCATCijiTGG AGCAGTCCAA TATGAAAATG GGCGTTTCAC GGTGCCGAAA 
GGAGTTGGGG CCAAGGGAGA CAGCGGACGA CCCATTCTGG ATAACCAGGG 
acgggtcJgtc GCTATTGTGC TGGGAGGTGT GAATGAAGGA TCTAGGACAG 
CCC::TTdAGT CGTCATGTGG AACGAGAAGG GAGTTACCGT GAAGTATACT 
CCGGAG^ACT GCGAGCAATG GTAATGA 



™k?dS d^poSmSIJS yacvvggklk rpmbvegkid ndvlaalktk 

rvCAKrS^^^ DVPQNMRADT FKYTHEKPQG YYSWHHGAVQ YENGRFTVPK 
PENCEgf ^^^^^^^^^ AIVLGGVNEG SRTALSWMvJ NEKGVTVKYT 
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Adapteln-1 nucleotide sequence: 



GTCATGAAAT 
GATiy\AdGGC 

atc^tgcAagg 
aaagcatcca 
ggccga-Iaca 

GGCATCi^TGG 
GGAGTTQGGG 
ACGGGTGGTC 
CCCTTTdAGT 
GAGGGACJTTA 
AGC 



TGGAATCTGA 
TACGCTTGTG 
CAAGATCGAC 
AATACGATCT 
TTCAAATACA 
AGCAGTCCAA 
CCAAGGGAGA 
GCTATTGTGC 
CGTCATGTGG 
CCGTGAAGTA 



CAAGACGTTC 
TJ3GTCGGAGG 
AACGACGTTC 
TGAGTATGCA 
CCCATGAGAA 
TATGAAAATG 
CAGCGGACGA 
TGGGAGGTGT 
AACAAGCTTT 
TACTCCGGAG 



eCAATCATGT 
GAAGTTATTC 
TGGCCGCGCT 
GATGTGCCAC 
ACCCCAAGGC 
GGCGTTTCAC 
CCCATTCTGG 
GAATGAAGGA 
CTCCACATTA 
AACTGCGAGC 



TGGAAGGGAA 
AGGCCGATGC 
TAAGACGAAG 
AGAACATGCG 
TATTACAGCT 
GGTGGCGAAA 
ATAACCAGGG 
TCTAGGACAG 
TGCTCAACTC 
AATGGTAATG 



Adapt:ein-2 
GTCATG?AAT 

gat;\aagggc 
atgtggAagg 
aaagcatcca 
ggccgataca 
cgcatcAtgc 
ggagtt(|ggg 
acgggtggtc 

CCCJTTqAGT 
GAGGGAGTTA 



AGC 



nucleotide 
TGGAATCTGA 
TACGCTTGTG 
CAAGATCGAC 
AATACGATCT 
TTCAAATACA 
ACCAGTCCAA 
CCAAGGGAGA 
GCTATTGTGC 

cgtcatgtgg 
ccgt<;aagta 



3Z_ 

sequence: 

CAAGACGTTC 

TGGTCGGAGG 

Zyi.CGACGTTC 

TGAGTATGCA 

CCCATGAGAA 

TATGAAAATG 

CAGCGGACGA 

TGGGAGGTGT 

AACAAGCTTA 

TACTCCGGAG 



CCAATCATGT 
GAAGTTATTC 
TGGCCGCGCT 
GATGTGCCAC 
ACCCCAAGGC 
GGCGTTTCAC 
CCCATTCTGG 
GAATGAAGGA 
GAAGCGGTAC 
AACTGCGAGC 



TGGAAGGGAA 
AGGCCGATGC 
TAAGACGAAG 
AGAACATGCG 
TATTACAGCT 
GGTGCCGAAA 
ATAACCAGGG 
TCTAGGACAG 
TCAATGGCTC 
ZATGGTAATG 



Adaptein-1 protein sequence: lO'^^-?? 

VMKLESDKTF PIMLEGKING YACVVGGKLF RPMHVEGKID NDVLAALKTK 

KASKYD1.EYA DVPQNMRADT FKYTHEKPQG YYSWHHGAVQ YENGRFTVPK 

GVGAKGIpSGR PILDNQGRW AIVLGGVNEG SRTALSWMW NKLSPHYAQL 
EGVTVKYTPE NCEQW 

i 



Cl \Q no : 
Adapteln-2 protein sequence: 

VMKLESI^KTF PIMLEGKING YACVVGGKLF RPMHVEGKID NDVLAALKTK 
KASKYDiEYA DVPQHMRADT FKYTHEKPQG YYSWHHGAVQ YENGRFTVPK 
GVGAKGDSGR PILDNQGRW AIVLGGVNEG SRTALSVVMW NKLRSGTQWL 
EGVTVKYTPE NCEQW 
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Alignment of adaptein nucleotide sequences with CCD sequence: 
GICATGAAAT TGGAATCTGA CAAGACGTTC CCAATCATGT TGGAAGGGAA 
G^CATGAAAT TGGAATCTGA CAAGACGTTC CCAATCATGT TGGAAGGGAA 
GTCATGAAAT TGGAATCTGA CAAGACGTTC CCAATCATGT TGGT^AGGGAA 



A-1 

A- 2 
CCD 

A-1 
A-2 
CCD 



A-1 
A-2 
CCD 

A-1 
A-2 

CCD 



GATAAACGGC TACGCTTGTG TGGTCGGAGG GAAGTTATTC AGGCCGATGC 
GjItAAACGGC TACGCTTGTG TGGTCGGAGG GAAGTTATTC AGGCCGATGC 
GzItAAACGGC TACGCTTGTG TGGTCGGAGG GAAGTTATTC AGGCCGATGC 



A-1 ATGTGGAAGG CAAGATCGAC AACGACGTTC TGGCCGCGCT TAAGACGAAG 
Ar2 ATfGTGGAAGG CAAGATCGAC AACGACGTTC TGGCCGCGCT TAAGACGAAG 
CCD A^GTGGAAGG CAAGATCGAC AACGACGTTC TGGCCGCGCT TAAGACGAAG 



AAAGCATCCA AATACGATCT TGAGTATGCA GATGTGCCAC AGAACATGCG 
A^^GCATCCA AATACGATCT TGAGTATGCA GATGTGCCAC AGAACATGCG 
A^IaGCATCCA AATACGATCT TGAGTATGCA GATGTGCCAC AG?XACATGCG 

GGCCGATACA TTCAAATACA CCCATGAGAA ACCCCfiAGGC TATTACAGCT 
GGCCGATACA TTCAAATACA CCCATGAGAA ACCCCAAGGC TATTACAGCT 
GcicCGATACA TTCAAATACA CCCATGAGAA ACCCCAAGGC TATTACAGCT 



GGCATCATGG AGCAGTCCAA TATGAAAATG GGCGTTTCAC GGTGCCGAAA 



A-1 

A-2 GC|CATCATGG AGCAGTCCAA TATGAAAATG GGCGTTTCAC GGTGCCGAAA 
CCD GGCATCATGG AGCAGTCCAA TATGAAAATG GGCGTTTCAC GGTGCCGAAA 



A"l 

A-2 

CCD GGAGTTGGGG CCAAGGGAGA CAGCGGACGA CCCATTCTGG ATAACCaGGG 



GGAGTTGGGG CCAAGGGAGA CAGCGGACGA CCCATTCTGG ATAACCAGGG 
GGAGTTGGGG CCAAGGGAGA CAGCGGACGA CCCATTCTGG ATAACCAGGG 



A-1 AGGGGTGGTC 
A- 2 a4gGGTGGTC 
CCD ACjGGGTGGTC 



A-1 
A-'2 
CCD 

A-1 

A-2 



GCTATTGTQC ^TGGGAGGTGT GAATGAAGGA TCTAGGACAG 
GCTATTGTGC TGGGAGGTGT GAATGAAGGA TCTAGGACAG 



i 

cdcTTTCAGT 
CGCTTTCAGT 
cdcTTTCAGT 



GCTATTGTGC TGGGAGGTGT GAATGAAGGA TCTAGGACAG 

(Hindlll) (Xhol 

CGTCATGTGG AAC AAGCTT TCTCCACATTA TGCTCAA CTCGA 

CGTCATGTGG AAC AAGCTT AGAAGCGGTAC TCAATGG CTCGA 

CGTCATGTGG AACGAG 



-GGAGTTA CCGTGAAGTA TACTCCGGAG AACTGCGAGC AATGGTAATGAGC 
-"GGAGTTA CCGTGAAGTA TACTCCGGAG AACTGCGAGC AATGGTAATGAGC 
C:CD AAGGGaGTTA CCGTGAAGTA TACTCCGGAG AACTGCGAGC AATGGTAATGAGC 
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Aiiqnro^nt of adaptein protein sequences with CCD sequence: 

A-l' VMKLESDKTF PIMLEGKING YACWGGKLF RPMHVEGKID NDVLAALKTK 

A-2 VMKLESDKTF PIMLEGKING YACWGGKLF RPMHVEGKID NDVLAALKTK 

CCD VMKLESDKTF PIMLEGKING YACWGGKLF RPMHVEGKID NDVLAALKTK 

A-l KASKYDLEYA DVPQNMRADT FKYTHEKPQG YYSWHHGAVQ YENGRFTVPK 

A-2 KiisKYDLEYA DVPQNMRADT FKYTHEKPQG YYSWHHGAVQ YENGRFTVPK 

CCD KASKYDLEYA DVPQNMRADT FKYTHEKPQG YYSWHHGAVQ YENGRFTVPK 

A-l GVGAKGDSGR PILDNQGRW AIVLGGVNEG SRTALSVVMW N-KLSPHYAQLE 

A-2 GVGAKGDSGR PILDNQGRW AIVLGGVNEG SRTALSWMW N-KLRSGTQWLB 

CCD GVGAKGDSGR PILDNQGRW AIVLGGVNEG SRTALSVVMW NE— 

A-l -qVTVKYTPE NCEQW 

A-2 -C^VTVKYTPE NCEQW 

CCD KGVTVKYTPE NCEQW 
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5'atotacggtcgtaaaaaacgtcgtcagcgtcgtcgtgtcatgaaattggaa 
tctgacaagacgttcccaatcatgttggaagggaagataaacggctacgctt 
gtg"ggtcggagggaagttattcaggccgatgcatgtggaaggcaagatcga 
caacgacgrtctggccgcgcttaagacgaagaaagcatccaaatacgatctt 

gaqtatgcagatgtgccacagaacatgcgggccgatacattcaaatacaccc 
atdagaaaccccaaggctattacagctggcatcatggagcagtccaatatga 
aaatgggcgtrrcacggtgccgaaaggagtlggggccaagggagacagcgg 
accacccattctggataaccagggacgggtggtcgctattgtgctgggaggt 

GTGAATGAAGGATCTAGGACAGCCCTTTCAGTCGTCATGTGGAACAAGCTTG 

GATCTTCTCTCGAGGGAGTTACCGrGAAGTATACTCCGGAGAACTGCGAGCA 
ATGOTAAS'. 



I 

MyGRKiCRRQRRRVMKLESDKTFPIMLEGKINGYACVVGGKLFIU>MHVEGKIDN 
DVLAALKTKKASKYDLEYADVPQNMRADTFKYTHEKPQGYYSWHHGAVQYE 
NGRFTYPKGVGAKGDSGRPILDNQGRVVAIVLGGVNBGSRTALSVVMWNEKGV 
TVKYTPENCEQW. 



